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Abstract 

We prove that the property of being closed (resp., palindromic, rich, privileged 
trapezoidal, balanced) is expressible in first-order logic for automatic (and some re¬ 
lated) sequences. It therefore follows that the characteristic function of those n for 
which an automatic sequence x has a closed (resp., palindromic, privileged, rich, trape¬ 
zoidal, balanced) factor of length n is automatic. For privileged words this requires a 
new characterization of the privileged property. We compute the corresponding char¬ 
acteristic functions for various famous sequences, such as the Thue-Morse sequence, 
the Rudin-Shapiro sequence, the ordinary paperfolding sequence, the period-doubling 
sequence, and the Fibonacci sequence. Finally, we also show that the function count¬ 
ing the total number of palindromic factors in a prefix of length re of a /^-automatic 
sequence is not ^-synchronized. 


1 Introduction 

Recently a wide variety of different kinds of words have been studied in the combinatorics on 
words literature, including the six flavors of the title: closed, palindromic, rich, privileged, 
trapezoidal, and balanced words. In this paper we show that, for /c-automatic sequences x 
(and some analogs, such as the so-called “Fibonacci-automatic” sequences [17]), the property 
of a factor belonging to each class is expressible in first-order logic; more precisely, in the 
theory Th(U, +,n ^ Previously we did this for unbordered factors [20]. 

As a consequence, we get that (for example) the characteristic sequence of those lengths 
for which a factor of that length belongs to each class is /^-automatic, and the number of 
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such factors of each length forms a /c-regular sequence. (For definitions, see, for example, 

[ 2 ].) 

Using an implementation of a decision procedure for first-order expressible properties, 
we can give explicit expressions for the lengths of factors in each class for some famous se¬ 
quences, such as the Thue-Morse sequence, the Rudin-Shapiro sequence, the period-doubling 
sequence, and the ordinary paperfolding sequence. For some of the properties, these expres¬ 
sions are surprisingly complicated. 


2 Notation and definitions 

As usual, if tc = xyz, we say that x is a prefix of tc, that .2 is a suffix of tc, and ?/ is a factor 
of tc. By |x|i„ we mean the number of (possibly overlapping) occurrences of tc as a factor 
of X. For example, |confrontation|on = 3. By x^ we mean the reversal (sometimes called 
mirror image) of the word x. Thus, for example, (drawer)^ = reward. By we mean the 
alphabet {0,l,...,/c — 1} of cardinality k. 

A factor tc of x is said to be right-special if both wa and wb are factors of x, for two 
distinct letters a and b. 

A word X is a palindrome if x = x^. Examples of palindromes in English include radar 
and redivider. Droubay, Justin, and Pirillo [16] proved that every word of length n contains 
at most n + 1 distinct palindromic factors (including the empty word). A word is called rich 
if it contains exactly this many. For example, the English words logology and Mississippi 
are both rich. For example, Mississippi has the following distinct nonempty palindromic 
factors: 

M, i, s, p, ss, pp, sis, issi, ippi, ssiss, ississi. 

For more about rich words, see [19, 15, 7, 5]. 

A nonempty word tc is a border of a word x if tc is both a prehx and a suffix of x. A 
word X is called closed (aka “complete hrst return”) if it is of length < 1, or if it has a 
border w with |x|^ = 2. For example, abracadabra is closed because of the border abra, 
while alfalfa is closed because of the border alfa. The latter example shows that, in the 
dehnition, the prehx and suffix are allowed to overlap. For more about closed words, see [3]. 

A word X is called privileged if it is of length < 1, or it has a border w with \x\yj = 2 
that is itself privileged. Clearly every privileged word is closed, but mama is an example of 
an English word that is closed but not privileged. For more about privileged words, see 
[23, 24, 25, 18]. 

A word X is called trapezoidal if it has, for each n > 0, at most n -|- 1 distinct factors of 
length n. Since for n = 1 the dehnition requires at most 2 distinct factors, this means that 
every trapezoidal word can be dehned over an alphabet of at most 2 letters. An example in 
English is the word deeded. See, for example, [14, 13, 15, 6]. 

A word X is called balanced if, for all factors y, z of the same length of x and all letters a of 
the alphabet, the inequality |||/|a — \z\a\ < 1 holds. Otherwise it is unbalanced. An example 
of a balanced word in English is banana. 
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We use the terms “infinite sequence” and “infinite word” as synonyms. In this paper, 
names of infinite words are given in the bold font. All infinite words are indexed starting at 
position 0. If x = xqXiX 2 ■ ■ ■ is an infinite word, with each Xi a single letter, then by x[i..j] 
for j >i — l we mean the finite word XiXi+i ■ ■ - Xj. By [i..j] we mean the set {i, z + 1,..., j}. 

3 Sequences 

In this section we define the five sequences we will study. For more information about these 
sequences, see, for example, [2]. 

The Thue-Morse sequence t = ■ • ■ = 01101001 ■ ■ ■ is defined by the relations to = 0, 

t 2 n = tn, and t 2 n+i = 1 — H IS also expressible as the fixed point, starting with 0, of the 
morphism /i : 0 —?■ 01, 1 10. 

The Rudin-Shapiro sequence r = rorir 2 • • ■ = 00010010-■■ is defined by the relations 

^0 0) ^2n ^ni ^4n+l ^8n+7 ^2n+l) ^16n+3 ^8n+3) ^16n+ll ^4n+3- H is alsO 

expressible as the image, under the coding r : n —?■ [n/2j, of the fixed point, starting with 
0, of the morphism p : 0 —)■ 01, 1 —)■ 02, 2 —)■ 31, 3 —?■ 32. 

The ordinary paperfolding sequence p = poPiP 2 --- = 00100110-is defined by the 
relations po = 0, p 2 n+i = Pm Pau = 0, p^n+i = 1- If is also expressible as the image, under 
the coding r above, of the fixed point, starting with 0, of the morphism p : 0 —01, 1 21, 

2 ^ 03, 3 ^ 23. 

The period-doubling sequence d = dodid 2 - - - = 10111010 - - - is defined by the relations 
do = 1, d 2 n = 1, d 4 n+i = 0, and d^n+s = dn- It is also expressible as the fixed point, starting 
with 1, of the morphism 5 : 1 —)■ 10, 0 —?■ 11. 

The Fibonacci sequence f = / 0 / 1/2 - - - = 01001010 - - - is the fixed point, starting with 0, 
of the morphism p : 0 —?■ 01, 1 —?■ 0. 


4 Common predicates 

Before we see how rich words, privileged words, closed words, etc. can be phrased as first- 
order predicates, let us define a few basic predicates. 

First, we have the two basic predicates lN(z,r, s), which is true iff z G [r..s]: 

In(z, r, s) := (z > r) A (z < s), 

and SUBS(z, j, m, n), which is true iff [z..z -|- m — 1] C [j..j -|- rz — 1]: 

SUBS(z, j, m, n) := (j < z) A (z-|-m < j-|-rz). 

Next, we have the predicate 

FactorEq(z, j, rz) := Vfc (/c < rz) {x.[i k] = x.[j k]) , 

which checks whether x[z..z -|- rz — 1] and x[j..j -|- rz — 1] are equal by comparing them at 
corresponding positions, x[z -|- k] and x[j -|- k], for fc = 0,... , rz — 1. By a similar principle, we 
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can compare + n — 1] with x[j..j + n — 1]^, but in this paper we only need the special 
case i = j, i.e., palindromes: 

PAL(i, n) := \/k {k < n) (x[i + k] = x[i + n — 1 — A:]). 

From FactorEq, we derive other useful predicates. For instance, the predicate 

OccURS(z, j, m, n) := (m < n) A {3k{k + m<n) A FactorEq(*, j + fc, m)) 

tests whether x[i..i + m — 1] is a factor of x[j..j + n — 1]. We also dehne 

BORDER(i,m,n) := lN(m, 1,n) A FACTOREQ(i,i + n — m,m), 

which is true iff x[i..i + m — 1] is a border of x[i..i + n — 1], 

In the next five sections, we obtain our results using the implementation of a decision 
procedure for the corresponding properties, written by Hamoon Mousavi, and called Walnut, 
to prove theorems by machine computation. The software is available for download at 
https://cs.uwaterloo.ca/~shallit/papers.html . 

All of the predicates in this paper can easily be translated into Hamoon Mousavi’s Walnut 
program. Files for the examples in this paper are available at the same URL as above, so 
the reader can easily run and verify the results. 

5 Closed words 

We can create a predicate CLOSED(i,n) that asserts that x[A.i + n — 1] is closed as follows: 

(n < 1) V (3j (j < n) A BORDER(i, j, n) A -'OccURS(i, i + 1, j, n - 2)) 
Theorem 1. (a) There is a closed factor of Thue-Morse of every length. 

(b) There is a 15-state automaton accepting the base-2 representation of those n for which 
there is a closed factor of Rudin-Shapiro of length n. 

(c) There is an 11-state automaton accepting the base-2 representation of those n for which 
there is a closed factor of the paperfolding seguence of length n. It is depicted below in 
Figure 1. 

(d) There is a closed factor of the period-doubling seguence of every length. 

(e) There is a closed factor of the Fibonacci seguence of every length. 
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( 1 ) 



Figure 1: Automaton for lengths of closed factors of the paperfolding sequence 

As we have seen above, the Thue-Morse sequence contains a closed factor of every length. 
We now turn to enumerating f{n), the number of such factors of length n. Here are the hrst 
few values of /(n): 


n 

0 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 

15 

fin) 

1 

2 

2 

2 

4 

4 

6 

4 

8 

8 

10 

8 

12 

8 

8 

8 


The hrst step is to create a predicate UCF(i, n) which is true ii + n — 1] is a closed 
factor of t of length n, and is also the hrst occurrence of that factor: 

UCF(*,?7.) := CLOSED(z,n) A -'OccURS(f, 0, n, f + n - 1). 

The associated DFA then gives us (as in [20]) a linear representation for /(n): vectors 
v^w and a matrix-valued homomorphism p : {0,1} such that f{n) = vfi{x)w'^ for 

all X that are valid base-2 representations of n. 

They are as follows (with nii) = Mj): 
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Mo = 


1 1 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 


0 0 0 
0 10 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 


0 0 0 
0 0 0 
0 1 1 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 


0 0 0 

0 0 0 

0 0 0 

oil 
0 0 0 

0 0 0 

0 0 0 

0 0 0 

0 0 0 

0 0 0 

0 0 0 

0 0 0 

0 0 0 

0 0 0 

0 0 0 

0 0 0 

0 0 0 

0 0 0 

0 0 0 

0 0 0 

0 0 0 

0 0 0 

0 0 0 

0 0 0 

0 0 0 

0 0 0 

0 0 0 

0 0 0 

0 0 0 

0 0 0 

0 0 0 


0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
10 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
10 0 
0 0 0 
10 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 


0 0 0 

0 0 0 

0 0 0 

0 0 0 

0 0 0 

oil 
0 0 0 

0 0 0 

0 0 0 

0 0 0 

0 0 0 

0 0 0 

0 0 0 

0 0 0 

0 0 0 

0 0 0 

0 0 0 

0 0 0 

0 0 0 

0 0 0 

0 0 0 

0 0 0 

0 0 0 

0 0 0 

0 0 0 

0 0 0 

0 0 0 

0 0 0 

0 0 0 

0 0 0 

0 0 0 


0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 1 
0 0 1 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 1 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 


0 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 


0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
110 
0 0 1 
0 0 1 
0 0 1 
0 0 0 
10 0 
0 0 0 
0 0 0 
0 10 
10 0 
0 0 1 
10 0 
0 0 0 
0 0 0 
10 0 
0 0 0 
10 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 


0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

1 

0 

0 

0 

0 

1 

2 

0 

0 

0 

0 


0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 


0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 


0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 10 
0 0 1 
0 0 0 
0 0 0 
0 10 
0 0 0 
10 0 
0 0 0 
0 0 0 
0 0 0 
10 0 
0 0 0 
10 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
10 1 
0 0 0 
0 0 2 


Ml = 


0 0 1 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 


10 0 
0 0 1 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 


0 0 0 
1 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 


0 0 0 
0 0 0 
10 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 


0 0 0 
0 0 0 
0 0 0 
1 1 0 
0 0 1 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
10 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
10 0 
0 0 0 
0 0 0 
0 0 0 
10 0 
0 0 0 
10 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 


0 0 0 
0 0 0 
0 0 0 
0 0 0 
10 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 


0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
110 
0 10 
0 10 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 10 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 


0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 


0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
10 0 
10 0 
0 0 0 
0 0 0 
0 10 
10 0 
0 0 0 
0 0 0 
0 10 
10 0 
0 0 0 
0 0 0 
0 10 
0 10 
10 0 
0 0 0 
10 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 


0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

0 

1 

0 

1 

0 


0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

1 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

0 

1 

0 


0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 


0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
10 0 
0 0 0 
0 0 0 
0 0 0 
0 10 
0 10 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
10 0 
2 0 0 
0 0 0 
0 0 2 
0 0 0 
0 0 2 
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U=[ll00100000000000000000000000000 


l(;=[l01100001000110001000110000100l] 

This linear representation can be minimized, using the algorithm in [4], obtaining 



■ 1 

0 

0 

0 

0 

0 

0 

0 

0 

0 ■ 


0 

0 

1 

0 

0 

0 

0 

0 

0 

0 


0 

0 

0 

0 

1 

0 

0 

0 

0 

0 


0 

0 

0 

0 

0 

0 

1 

0 

0 

0 


0 

0 

0 

0 

0 

0 

0 

0 

1 

0 


0 

0 

0 

0 

0 

-1 

1 

1 

1/2 

0 


0 

0 

0 

0 

0 

-2 

2 

0 

-3 

4 


0 

0 

0 

0 

0 

0 

0 

2 

4 

-4 


0 

0 

0 

0 

0 

0 

0 

0 

2 

0 


0 

0 

0 

0 

0 

0 

0 

1/2 

11/4 

-1 


■ 0 

1 

0 

0 

0 

0 

0 

0 

0 

0 ■ 


0 

0 

0 

1 

0 

0 

0 

0 

0 

0 


0 

0 

0 

0 

0 

1 

0 

0 

0 

0 


0 

0 

0 

0 

0 

0 

0 

1 

0 

0 


0 

0 

0 

0 

0 

0 

0 

0 

0 

1 


0 

0 

0 

0 

0 

2 

-2 

-1 

4 

-2 


0 

0 

0 

0 

0 

0 

0 

0 

1 

0 


0 

0 

0 

0 

0 

4 

-4 

0 

10 

-8 


0 

0 

0 

0 

0 

0 

0 

0 

2 

0 


_ 0 

0 

0 

0 

0 

1 

-1 

-1/2 

7/2 

-1 

v ' 

= [ 

1 

0 

0 

0 

0 

0 

0 0 

0 0 


w ' 

= 


2 

2 

2 

4 

4 

6 4 

8 8 



From this, using technique in [20], we can obtain the following relations 


/(8n) 
f { 8 n + l ) 
/(8n + 3) 

/(8n + 4) 

/(8n + 5) 
/(8n + 7) 

/(16n + 2) 

/(16n + 6) 
/(16n + 10) 
/(32n + 14) 
/(32n + 30) 


-2f{2n + 1) + /(4n) + 2/(4n + 1) 

-2/(2n + l) + 3/(4n+l) 

-2f{2n + 1) + 2/(4n + 1) + /(4n + 3) 

2f{2n + 1) - ^f{4n + 1) + /(4n + 2) + ^/(4n + 3) + /(8n + 2) 

2/(4n + 3) 

-4/(2n + 1) + 2/(4n + 1) - 2/(4n + 3) + 2/(8n + 6) 

13 1 

-6/(2n + 1) + y/(4n + 1) + -/(4n + 3) 

+ 1) + /(4n + 2) + ^/(4n + 3) + /(8n + 2) 

2/(4n + 3) + /(8n + 6) 

-2f{2n + 1) - ^/(4n + 1) + 3/(4n + 2) + ^/(4n + 3) + 3/(8n + 2) 

24/(2n + 1) - 6/(4n + 1) + 14/(4n + 3) - 4/(8n + 2) - 12/(8n + 6) + 5/(16n + 14). 
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From these we can verify the following theorem by a tedious induction on n: 
Theorem 2. Let n > 8 and let k > —1 be an integer. Then 


2^+4 

z/ 15 • 2^ < n < 18 

• 2 ^ 

2n-20-2^ - 2 , 

z/ 18 • 2^ < n < 19 

• 2 ^ 

56 ■ 2^ - 2n + 2, 

z/ 19 • 2^ < n < 20 

• 2 ^ 

4n-64-2^-4, 

z/ 20 • 2 ^ < n < 22 

• 2 ^= 

112-2^-4n + 4, 

if 22-2’^ <n <24 

■ 2 ^ 

2^+4 

i/ 24 ■ 2^ < n < 28 

■ 2 ^ 

8 n - 208 - 2 ^- 8 , 

\ 

z/ 28 • 2^ < n < 30 

• 2 ^= 


6 Palindromic words 

Palindromes in words have a long history of being studied; for example, see [1]. 

It is already known that many aspects of palindromes in /c-automatic sequences are 
expressible in first-order logic; see, for example, [ 11 ]. 

In this section, we turn to a variation on palindromic words, the so-called “maximal 
palindromes”. For us, a factor x of an inhnite word w is a maximal palindrome if x is a 
palindrome, while no factor of the form axa for a a single letter occurs in w. This differs 
slightly from the existing definitions, which deal with the maximality of occurrences [ 22 ]. 

The property of being a maximal palindrome is easily expressible in terms of predicates 
dehned above: 

MAXPAL(q?7,) := PAL(i,n) A (Vj ((j > 1) A FactorEq(*, j, n)) x[j — 1] 7 ^ x[j + n]) 
Using this, and our program, we can easily prove the following result: 

Theorem 3. (a) The Thue-Morse sequence contains maximal palindromes of length 3 - AT 

for each n > 0, and no others. These palindromes are of the form u^^fOlO) and 
//^"(lOl) forn> 0. 

(b) The Rudin-Shapiro sequence contains exactly 8 maximal palindromes. They are 

0100010 , 0001000 , 1110111 , 1011101 , 0010000100 , 1101111011 , 1110110111 , 10000100100001 . 

(c) The ordinary paperfolding sequence contains exactly 6 maximal palindromes. They are 

001100 , 110011 , 011000110 , 100111001 , 1000110110001 , 0111001001110 . 

(d) The period-doubling sequence contains maximal palindromes of lengths 3 • 2"' — 1 for all 
n >0, and no others. 


(e) The Fibonacci sequence contains no maximal palindromes at all. 





We now turn to a result about counting palindromes in automatic sequences. To state it, 
we first need to describe representations of integers in base k. By {n)k we mean the string 
over the alphabet := { 0 , 1 ,..., fc — 1 } representing n in base fc, and having no leading 
zeroes. This is generalized to representing r-tuples of integers by changing the alphabet to 
and padding shorter representations on the left, if necessary, with leading zeroes. Thus, 
for example, (6, 8)2 = [1, 0][1,1][0,1]. By [w]k, for a word w, we mean the value of w when 
interpreted as an integer in base k. 

Next, we need the concept of /c-synchronization [10, 8 , 9, 21]. We say a function f{n) is 
k-synchronized if there is a hnite automaton accepting the language {(n, f{n))k '■ n > 0 }. 
The following is a useful lemma: 

Lemma 4. If (/(n))„>o is a k-synchronized sequence, and f 7 ^ 0(1), then there exists a 
eonstant c > 0 such that f{n) > cn infinitely often. 

Proof. Since / 7 ^ 0(1), there exists n > 0 such that f{n) > k^, where N is the number 
of states in the minimal automaton accepting L^, where L = {{n, f{n))k : n > 0}. Apply 
the pumping lemma to the string z = (n,/(n))f. It says that we can write = uvw, 
where \uv\ < n and w has nonzero elements in both components. Then, letting (n*, f{ni)) = 
[{uv'^w)^]k we see that this subsequence has the desired property. □ 

Theorem 5. The function counting the number of distinct palindromes in a prefix of length 
n is not k-synchronized. 

Proof. Our proof is based on two infinite words, a = (ai)i>o and b = ( 6 j)i>o. 

The word a is defined as follows: 


Oji — 


{k mod 2 ) + 1 , 

0 , 


if there exists k such that 4^+^ — <i < + 4^; 

otherwise. 


The word b is dehned as follows: 


h 


{k mod 2 ) + 1 , if there exists k such that 4^+^ — A^ < i < 4^+^ + 4^; 
0 , otherwise. 


We leave the easy proof that a and b are 4-automatic to the reader. 

We now compare the palindromes in a to those in b. From the definition, every palin¬ 
drome in either sequence is clearly in 


0* + 1 * + 2 * + on*o* + 0*2*0*. 


Since a has longer blocks of I’s and 2s than b does, there may be some palindromes of the 
form 1 * or 2 * that occur in a prefix of a, but not the corresponding prefix of b. Conversely, 
b may contain palindromes of the form 0 * that do not occur in the corresponding prehx of 

a. 

Call an occurrence of a factor in a word novel if it is the hrst occurrence in the word. 
The remaining palindromes (of the form ON-^O* or 0*2-^b*) must be centered at a position that 
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is a power of 4. It is not hard to see that if + n — 1] is a novel palindrome occurrence 
of this form in a, then b[i..i + n — 1] is also a novel palindrome occurrence of this form. 

On the other hand, for each k > 1, there are two palindromes that occur in b but not 
a. The first is of the form 01-^0 or 02-^0, since the corresponding factor of a is either 1 ■ • • 1 
or 2 • • • 2 , and hence has been previously accounted for Second, there is a factor of the form 
0 * 1 * 0 * or 0 * 2 * 0 * which appears as 20 * 1 * 0 * or 10 * 2 * 0 * in a, since the neighbouring block of 
I’s or 2’s is slightly wider and therefore slightly closer. We conclude that the length-n prefix 
of b has 21 og 4 n + 0 ( 1 ) more palindromes than the length-n prefix of a. 

Now suppose, contrary to what we want to prove, that the number of palindromes in the 
prefix of length n of a fc-automatic sequence is ^-synchronized. In particular, the sequence 
a (resp., b) is 4-automatic, so the number of palindromes in a[0..n — 1] (resp., b[0..n — 1] 
is 4-synchronized. Now, using a result of Carpi and Maggi [10, Prop. 2.1], the number of 
palindromes in b[l..n] minus the number of palindromes in a[l..n] is 4-synchronized. But 
from above this difference is 21 og 4 ? 7 , -|- 0(1), which by Lemma 4 cannot be 4-synchronized. 
This is a contradiction. □ 


7 Rich words 

As we have seen above, a word x is rich iff it has |x| -|- 1 distinct palindromic subwords. As 
stated, it does not seem easy to phrase this in first-order logic. Luckily, there is an alternative 
characterization of rich words, which can be found in [16, Prop. 3]: a word is rich if every 
prefix p oi w has a palindromic suffix s that occurs only once in p. This property can be 
stated as follows: 

RlCH(i, n) := Vm lN(m, 1, n) 

(3j SUBS(j, i, l,m) A PAL(j, i-f m - j) A -'OccURS(j, z, z d-m - j, m - 1)). 
Finally, we can express the property that x has a rich factor of length n as follows: 

3z RlCH(z,rz). 

Theorem 6 . (a) The Thue-Morse sequence contains exactly 161 distinct rich factors, the 
longest being of length 16. 

(b) The Rudin-Shapiro sequence contains exactly 975 distinct rich factors, the longest being 
of length 30. 

(c) The ordinary paperfolding sequence contains exactly fQf distinct rich factors, the longest 
being of length 23. 

(d) The period-doubling sequence has a rich factor of every length. In fact, every factor of 
the period-doubling sequence is rich. 

(e) Every factor of the Fibonacci sequence is rich. 

Of course, (e) was already well known. 
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8 Privileged words 

The recursive definition for privileged words given above in Section 2 is not obviously ex¬ 
pressible in first-order logic. However, we can prove a new, alternative characterization of 
these words, as follows: 

Let’s say a word w has property P if for all n, 1 < n < |tc|, there exists a word x such 
that 1 < |a;| < n, and x occurs exactly once in the first n symbols of tc, as a prefix, and x 
also occurs exactly once in the last n symbols of w, as a suffix. 

Lemma 7. If w is a bordered word with property P, then every border also has property P. 

Proof. Let z he a. border of w. Given any 1 < n < \z\, property P for w says that there 
exists a border x oi w such that 1 < |x| < n, and x occurs exactly once in the first (resp., 
last) n symbols in w. Then observe that the first (resp., last) n symbols of w are precisely 
the first (resp., last) n symbols of . 2 . Since x is also a border of z., it follows that z has 
property P. □ 

Theorem 8. A word w is privileged if and only if it has property P. 

Proof. If w is privileged, then, by definition, there is a sequence of privileged words w = 
wo,wi, ...,Wk-i,Wk such that \wk\ = 1 and for all i, Wi+i is a prefix and suffix of Wi and 
occurs nowhere else in Wi. Given an integer n, let x be the largest Wi such that \wi\ < n. 
Either i = 0 because n = \w\ and everything works out, or > n. Then Wi is a prefix 

of Wi-i (and therefore a prefix of w), and there is no other occurrence of Wi in Wi-i (which 
includes the first n symbols of w). Similarly, Wi is a suffix of tc, but does not occur again in 
the last n symbols of w. 

For the other direction, we assume the word has property P and use induction on the 
length of w. If |tc| = 1 then the word is privileged immediately. Otherwise, take n = \w\ — l 
and find the corresponding x promised by property P. Then x is both a prefix and a suffix 
of tc, so it has property P. It is also shorter than tc, so by induction, x is privileged. Then x 
is a privileged prefix and suffix of w which does not occur anywhere else in w (by property 
P), so tc is privileged. □ 

This property can be represented as a predicate in two different ways. First, let’s write 
a predicate that is true iff the prefix x[L.i -|- m — 1] occurs exactly once in x[L.i -|- n — 1]: 

UNlQUEPREF(i, m, n) := Vj lN(j, 1, n - m - 1) -'FactorEq( 2 , i -f j, m). 

There is a similar expression for whether the suffix x[i + n — m..i -|- n — 1] occurs exactly 
once in x[i..i -|- n — 1]: 

UNlQUESuFF(i,m,n) := Vj lN(j, 1, n —m —1) -iFACTOREQ(i-fn —m,i-fn —m—j,m). 

And finally, our first characterization of privileged words is 

PRIv(i, n) := (n < 1) V (VmlN(m, l,n) 

(3plN(p, l,m) A BoRDER(i,p, n) A UNlQUEPREF(i,p, m) A UNlQUESuFF(i+?7,-m,p, m))). 
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Alternatively, we can write 
Priv'(z, n) := (n < 1) V (Vm lN(m, 1, n) 

{3p In(p, l,m) A Border(z,p, n) A -'OccURS(i, i+l,p, m-1) A -'OccURS(i, i+n-m,p, m-1))). 

Theorem 9. (a) There is a AG-state automaton accepting the base-2 expansions of those n 
for which the Thue-Morse sequence has a privileged factor of length n. 

(b) There is an SA-state automaton accepting the base-2 expansions of those n for which the 
Rudin-Shapiro sequence has a privileged factor of length n. 

(c) There is a A7-state automaton accepting the base-2 expansions of those n for which the 
paperfolding sequence has a privileged factor of length n. 

(d) The set of n for which the period-doubling sequence has a privileged factor of length n is 

{0,2} U {2?7. + 1 : n > 0}. 

There is a A-state automaton accepting the base-2 expansions of those n for which the 
period-doubling sequence has a privileged factor of length n. It is illustrated below in 
Figure 2. 

(e) There is a 20-state automaton accepting the Zeckendorf representations of those pairs 
{i,n) for which i[i..i + n — 1] is privileged. It is illustrated below in Figure 3. The 
Fibonacci word has privileged factors of every length. If n is even there is exactly one 
privileged factor. If n is odd there are exactly two privileged factors. 

Remark 10. For (a)~(d) we used Priv and for (e) we used Priv'. 



Figure 2: Automaton for lengths of privileged factors of the period-doubling word 
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Figure 3: Automaton for privileged factors of the Fibonacci word 

We now turn to recovering some of the results of [25] on the number a{n) of privileged 
factors of the Thue-Morse sequence. Here are the hrst few values of this sequence 


n 

0 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 

15 

16 

a{n) 

1 

2 

2 

2 

2 

0 

4 

0 

8 

0 

8 

0 

4 

0 

0 

0 

0 


As we did above for closed words, we hrst make an automaton for the hrst occurrences of 


each privileged factor of length n. We then convert this to a linear representation 


obtaining 
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Ml = 


- 0 0 

0 0 

0 0 

0 0 

0 0 

0 0 

0 0 

0 0 

0 0 

0 0 

0 0 

0 0 

0 0 

0 0 

0 0 

0 0 

0 0 

0 0 

0 0 

0 0 

0 0 

0 0 

0 0 

0 0 

0 0 

0 0 

0 0 

0 0 

0 0 

. 0 0 


110 0 0 
0 0 0 1 1 
0 0 0 0 0 
0 0 0 0 0 
0 0 0 0 0 
0 0 0 0 0 
0 0 0 0 0 
0 0 0 0 0 
0 0 0 0 0 
0 0 0 0 0 
0 0 0 0 0 
0 0 0 0 0 
0 0 0 0 0 
0 0 0 0 0 
0 0 10 0 
0 0 10 0 
0 0 0 0 0 
0 0 0 0 0 
0 0 0 0 0 
0 0 0 0 0 
0 0 0 0 0 
0 0 0 0 0 
0 0 0 0 0 
0 0 0 0 0 
0 0 0 0 0 
0 0 0 0 0 
0 0 0 0 0 
0 0 0 0 0 
0 0 0 0 0 
0 0 0 0 0 


0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 


0 0 0 0 
0 0 0 0 
10 0 0 
0 0 0 1 
0 0 0 0 
0 0 0 0 
0 0 0 0 
0 0 0 0 
0 0 0 0 
0 0 0 0 
0 0 0 0 
0 0 0 0 
0 0 0 0 
0 0 0 0 
0 0 0 0 
0 0 0 0 
0 0 0 0 
0 0 0 0 
0 0 0 0 
0 0 10 
0 0 10 
0 0 0 0 
0 0 0 0 
0 0 0 0 
0 0 0 0 
0 0 0 0 
10 0 0 
0 0 0 0 
0 0 0 0 
0 0 0 0 


0 0 
0 0 
0 0 
1 0 
0 1 
0 0 
0 0 
0 0 
0 0 
0 1 
0 0 
0 0 
0 0 
0 1 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 1 
0 0 
0 0 
0 0 
0 0 


0 0 0 0 0 
0 0 0 0 0 
0 0 0 0 0 
0 0 0 0 0 
1 0 0 0 0 
0 0 0 1 1 
0 0 0 0 1 
0 0 0 0 1 
0 0 0 0 0 
1 0 0 0 0 
1 0 0 0 0 
0 0 0 0 0 
0 0 0 1 0 
1 0 0 0 0 
0 0 0 0 0 
0 0 0 0 0 
0 0 0 0 1 
0 0 0 0 0 
0 0 0 0 0 
0 0 0 0 0 
0 0 0 0 0 
0 0 0 1 0 
0 0 0 0 0 
0 0 0 1 0 
0 0 0 0 0 
1 0 0 0 0 
0 0 0 0 0 
0 0 0 0 0 
0 0 0 0 0 
0 0 0 0 0 


0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 


0 0 0 0 
0 0 0 0 
0 0 0 0 
0 0 0 0 
0 0 0 0 
0 0 0 0 
0 0 0 0 
10 0 0 
10 0 0 
0 0 0 0 
0 0 0 0 
0 10 0 
0 0 0 0 
0 0 0 0 
0 0 0 0 
0 10 0 
10 0 0 
0 0 0 0 
0 0 0 0 
0 0 0 0 
0 0 0 0 
0 0 0 0 
0 0 0 0 
0 0 0 0 
0 0 0 0 
0 0 0 0 
0 0 0 0 
0 10 0 
0 0 0 0 
0 0 0 0 


0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
1 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 1 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 


0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 


U = [ll0010000000000000000000000000] 


l(;=[l0110000100011000100010001001l] 

We can then obtain relations for the seqnence (a(n))>o: 


a(4?7, + 3) 
a{8n + 1) 
a{8n + 5) 

a(16?7, + 6) 
a(16n + 8) 
a(16n + 10) 
a(16n + 12) 


a(4n + 1) 
a(4n + 1) 

0 

1 1 

a(4n + 1) + a(4?7, + 2) — -a(16n + 2) + -a(16n + 4) 

1 3 

3a(4n + 1) + 3a(4n + 2) — -a(16n + 2) — -a(16n + 4) 

1 3 

3a(4n + 1) + 3a(4n + 2) — -a(16n + 2) — -a(16n + 4) 

1 1 

a(4?7, + 1) + a(4?7, + 2) — -a{lQn + 2) + -a(16n + 4) 
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a(32n) = a(2n + 1) — -a(4n + 1) + 3a(8n + 2) — 3a{8n + 4) 

a{32n + 2) = —ra{2n + 1) + a(4n + 1) + 3a{8n + 2) — 2a(8?7, + 4) 
a{32n + 4) = —a{2n + 1) + a(4?7, + 1) + a{8n + 2) 
a{32n + 14) = —a{2n + 1) + a{8n + 4) 
a{32n + 16) = —a{2n + 1) + a{8n + 4) 
a{32n + 20) = a{32n + 18) 

a{32n + 30) = 2a{2n + 1) + a{8n + 2) — 3a{8n + 4) + 2a{8n + 6) — a{32n + 18) 
a(64?7, + 18) = a(4?7, + 1) 
a(64?7, + 50) = 0 

We can also do the same thing for the number of privileged palindromes {b{n))n>o in the 
Thue-Morse sequence. Here are the hrst few values: 


n 

0 

1 2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 

15 

16 

b{n) 

1 

2 2 

2 

2 

0 

4 

0 

4 

0 

4 

0 

4 

0 

0 

0 

0 


We omit the details and just present the computed relations: 


&(4n + 3) 
b{8n + 1) 
b{8n + 4) 
b{8n + 5) 
b{16n + 6) 
b{16n + 8) 
5(16n + 10) 
&(16n + 14) 

b{32n) 

b{32n + 2) 
b{32n + 16) 
5(64n + 18) 
5(64n + 50) 


6(4n + 1) 

5(4n + 1) 

b{8n + 2) 

0 

fe(4n + 1) + &(4n + 2) 

b{An + 1) + b{An + 2) 

6(4n + 1) + b{An + 2) 

—6(4?7, + 1) + b{16n + 2) 

b{2n + 1) — ^&(4n + 1) 

—b{2n + 1) + b{An + 1) + b{8n + 2) 

-5(2n + 1) + 6(8n + 2) 

6(4n + 1) 

0 


9 Trapezoidal words 

Trapezoidal words have many different characterizations. The characterization that proves 
useful to us is the following [6, Prop. 2.8]: a word w is trapezoidal iff |tc| = R^, + Ky^. Here 
Rw is the minimal length ^ for which w contains no right-special factor of length and Ky, 
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is the minimal length ^ for which there is a length-^ suffix of w that appears nowhere else in 

w. 

This can be translated into Th(N, +, n —)■ x[n]) as follows: RTSp(j, n,p) is true iff x[j..j + 
n — 1] has a right special factor of length p, and false otherwise: 

RTSp(j, n,p) := 3r 3s (SUBS(r, + 1,77.) A SUBS(s, + 1, n) A 

FACTOREQ(r, s,p) A x[s + p] 7 ^ x[r + p]). 

MlNRT(j, n,p) is true iff p is the smallest integer such that x[j..j + n — 1] has no right 
special factor of length p: 

MlNRT(j, ?7,,p) := (-iRTSp(j, ?7,,p)) A (Vc (-'RTSp(j, n, c)) (c > p)). 

UNREPSuF(j, n, g) is true iff the suffix of length q of x[j..j + n — 1] is unrepeated in 
x[j..j + n- 1]: 

UNREPSUF(j, n, q) := -'OccURS(j + n - q,j,q,n - 1). 

MlNUNREPSuF(j, n,p) is true iff p is the length of the shortest unrepeated suffix of 

x. [j..j + n- 1]: 

MlNUNREPSUF(j, n,p) := UNREPSUF(j, n, g) A (Vc UNREPSUF(j, n, c) (c>g)). 

TRAP(j, n) is true iff x[j..j + n — 1] is trapezoidal: 

TRAP(j, n) := 3p 3g (n = p +g) A MlNUNREPSUF(j, ?7,,p) A MlNRT(j, n, g). 

Finally, we can determine those n for which x has a trapezoidal factor of length n as 
follows: 

3j TRAP(j,n). 

Theorem 11. (a) There are exactly 43 trapezoidal factors of the Thue-Morse sequence. The 
longest is of length 8. 

(b) There are exactly 185 trapezoidal factors of the Rudin-Shapiro sequence. The longest is 
of length 12. 

(c) There are exactly 51 trapezoidal factors of the ordinary paperfolding sequence. The 
longest is of length 8. 

(d) There are exactly 77 trapezoidal factors of the period-doubling sequence. The longest is 
of length 15. 

(e) Every factor of the Fibonacci word is trapezoidal. 

For parts (b) and (c) above, we used the least-signihcant-digit hrst representation in 
order to have the computation terminate. 
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10 Balanced words 


Our definition of balanced word above does not obviously lend itself to a definition in first- 
order arithmetic. However, for binary words, there is an alternative characterization (due 
to Coven and Hedlund [12]) that we can use: a binary word w is unbalanced if and only if 
there exists a palindrome v such that both OnO and Ivl are factors of w. 

Thus we can write dehne UNBAL(i,n), a predicate which is true iff x[h.i -|- n — 1] is 
unbalanced, as follows: 

3m (m > 2) A (3j 3/c (SUBS(j, i, m, n) A SUBS(/c, i, m, n) A PAL(j, m) 

A Pal(/c, m) A FACTOREQ(j + l,k + l,m-2) A x[j] ^ x[/c])) 

Theorem 12. (a) The Thue-Morse word has exactly balanced factors. The longest is of 
length 8. The Thue-Morse word has unbalanced factors of length n exactly when n > 4. 

(b) The Rudin-Shapiro word has exactly 157 balanced factors. The longest is of length 12. 
The Rudin-Shapiro word has unbalanced factors of length n exactly when n > 4. 

(c) The ordinary paperfolding word has exactly 51 balanced factors. The longest is of length 
8. The ordinary paperfolding word has unbalanced factors of length n exactly when n> A. 

(d) The period-doubling word has exactly 69 balanced factors. The longest is of length 15. 
The period-doubling word has unbalanced factors of length n exactly when n >6. 

(e) All factors of the Fibonacci word are balanced. 

Of course, (e) was already well known. 


11 Consequences 

As a consequence we get 

Theorem 13. Suppose ^ is a k-automatic seguence. Then 

(a) The characteristic seguence of those n for which x has a closed (resp., palindromic, 
maximal palindromic, privileged, rich, trapezoidal, balanced) factor of length n is k- 
automatic. 

(b) The seguence counting the number of closed (resp., palindromic, maximal palindromic, 
privileged, rich, trapezoidal, balanced) factors of length n is k-regular. 

(c) R is decidable, given a k-automatic seguence, whether it contains arbitrarily long closed 
(resp., palindromic, maximal palindromic, privileged, rich, trapezoidal, balanced) factors. 
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(d) There exists a function g{k,i,n) such that if a k-automatic sequence w taking values 
over an alphabet of size i, generated by an n-state automaton, has at least one closed 
(resp., palindromic, maximal palindromic, privileged, rich, trapezoidal, balanced) factor, 
then it has a factor of length < g{k, i, n). The function g does not depend on w. 

(e) There exists a function h{k, i, n) such that if a k-automatic sequence w taking values over 
an alphabet of size i, generated by an n-state automaton, has a closed (resp., palindromic, 
maximal palindromic, privileged, rich, trapezoidal, balanced) factor of length > h{k,£,n), 
then it has arbitrarily large such factors. The function h does not depend on w. 

Proof. Parts (a) and (c) follow from, for example, [26, Theorem 1]. For part (b) see [11]. 
Parts (d) and (e) follows from the construction converting the logical predicate for the 
property to an automaton. □ 
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